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Motivated by applications to distributed storage, Gopalan et al recently introduced the 
interesting notion of information-symbol locality in a linear code. By this it is meant that 
each message symbol appears in a parity-check equation associated with small Hamming 
l3 ■ weight, thereby enabling recovery of the message symbol by examining a small number of 

other code symbols. This notion is expanded to the case when all code symbols, not just the 
message symbols, are covered by such "local" parity. In this paper, we extend the results of 
Gopalan et. al. so as to permit recovery of an erased code symbol even in the presence of 
errors in local parity symbols. We present tight bounds on the minimum distance of such 
codes and exhibit codes that are optimal with respect to the local error-correction property. 
^\ As a corollary, we obtain an upper bound on the minimum distance of a concatenated code. 



I. Introduction 



■<sj- 1 In IIJ, Gopalan et al introduced the interesting and practically relevant notion of locality of infor- 
mation. The z*^ code-symbol Cj, 1 < i < n, of an [n,k,d\ linear code C over the field Fg is said to 

^ . have locality r if this symbol can be recovered by accessing at most r other code symbols of code C. 

^ l Equivalently, for any coordinate i, there exists a row in the parity-check matrix of the code of Hamming 

; weight at most r + 1, whose support includes i. An (r, d) code was defined as a systematic linear code 

CN ■ C having minimum distance d, where all k message symbols have locality r. It was shown that the 

T1 " minimum distance of an (r, d) code is upper bounded by 



^ ■ d < n-k 



2. (1) 



A class of codes constructed earlier and known as pyramid codes ^ are shown to be (r, d) codes that 
are optimal with respect to this bound. 

The concept of an (r, d) code was motivated by the problem of designing efficient codes for the 
distributed storage of data across nodes in a network. Since nodes are prone to failure, there is need to 
protect the data using an error-correcting code. A second important requirement in this setting, is the 
ability to efficiently bring up a failed node. Here, (r, d) codes offer the advantage that in the event of 
a single node failure, the node can be locally recovered by connecting to at most r other nodes. 

A natural extension to the concept of an (r, d) code, is a code that would allow local recovery of 
a failed node, even in the presence of failures in other nodes of the network. Multiple node failures 
are not uncommon in distributed data storage, and a number of coding schemes for tolerating such 
multiple node failures exist in practice [i2Jll3J[i4j. This motivates the definition of the class of {r,d,6) 
local-error-correction (LEG) codes given below. 

Definition 1: The ith code symbol q, 1 < i < n, in an [n,k,d] linear code C, will be said to 
have locality (r, 5) if there exists a punctured subcode of C with support containing i, whose length 



is at most r -\- 6 — 1, and whose minimum distance is at least 6. Equivalently, there exists a subset 
S'i [n] = {1, . . . ,n} such that 

• i e Si and \Si\ < r + S — 1, 

• the minimum distance of the code CI5. obtained by deleting code symbols q, i G [njyS'i, is least 



Since the dual of a punctured code is a shortened code, this also implies that we may regard the 
parity-check matrix H of the code as containing for some Ui,l < Ui < n — k, a (ui x n) submatrix Hi 
having rank z/j, support Si, and the property that any 6—1 columns of Hi with indices drawn from Si, 
are linearly independent. 

A systematic [n, k, d] linear code C will be said to be an (r, 6)i code, if all k message (or information) 
symbols have locality (r, 5). We will also refer to such a code as having information locality (r, 5). It 
is clear that if we employ an (r, 6)i code for the distributed storage of data, a systematic node can be 
locally repaired by connecting to r other nodes, even if 5 — 2 other nodes fail. An additional advantage 
of an (r, S)i code is that even when the other nodes are intact, the code provides multiple options for 
locally repairing a failed systematic node, which in a network setting, can be used to balance traffic 
across the network Q. The {r,d) codes introduced by Gopalan et al correspond to (r, 2)j codes in the 
present notation. 

By using properties of the generalized Hamming weights |l5l of a code (also known as minimum 
support weightsjH), we will show that the minimum distance of an (r, 6)i is upper bounded (Theorem [2l) 



As was the case with the (r, d) codes introduced in |[T1, a class of pyramid codes turns out to provide 
examples of optimal (r, S)i codes, i.e., (r, S)i codes is which the bound in ^ is achieved with equality. 
For the special case when r\k, we will identify conditions that the parity check matrix of an optimal 
(r, S)i code must necessarily satisfy. 

We will term a code in which all the n symbols of an [n,k, d] code have locality (r, 5) as codes as 
having all-symbol locality (r, 6) and denote such codes as (r, 6)a codes. Thus, whenever we speak of 
either an (r, d)i or else an (r, S)a code, it will be assumed that the length, dimension and minimum 
distance of the linear code are understood from the context and are typically denoted by n, k, d 
respectively. Clearly, codes with all-symbol locality are a subset of the set of codes with just information 
locality. Nevertheless, it turns out that when {r + 6 — l)\n, one can show the existence of codes with 
all-symbol locality {r,6), which satisfy the upper bound on minimum distance given in We will 
also present an explicit code having all-symbol locality, for the case when the code length is of the 
form n= \^]{r + 6 - 1). 

Through out this write up, we will assume without loss of generality, that the [n, k, d] code C under 
study, is systematic, with information symbols present in the first k coordinates. For a codeword c e C, 
we will use supp(c) to denote the support {? G [n] | Cj 7^ 0} of the codeword. The support of a subcode 
V of C, is defined by supp(r') = Uceosupp(c). For a set 5" C [n], we will use C\s and to denote 
respectively, the punctured and shortened codes of C associated with the coordinate set S. By this we 
mean that under either the puncturing or shortening operation, the coordinates of the code lying in [n]\S 
are suppressed. Also, for any set S, the cardinality of the set will be denoted by IS"]. 

Section |ll] presents background on generalized Hamming weights, while codes with information and 
all-symbol locality are treated in Sections UlI] and |IV] respectively. In the final section. Section |Vl we 
present as a corollary, an upper bound on the minimum distance of a concatenated code. 



5. 



by 




(2) 



'By connecting to any r out of the r + S — 2 nodes which locally protect the failed node, one can recover the failed node. 



II. Generalized Hamming weights 

In this section, we review the definition of the generalized Hamming weight (GHW) of a code [|5l, 
[|71 and see how the GHWs of a code are related to those of its dual. We introduce the notion of a gap 
which will play an important role in our subsequent proofs. 

Definition 2: The i*'^, 1 < z < A;, generalized Hamming weight of a code C is defined by 

di{C) = di = mm |Supp(P)|, (3) 

dim(r>)=i 

where V < C, is used to denote a subcode V of C. 
It is well known that 

d = di < d2 < . ■ ■ < dk = n. (4) 

We will call the complement of the set {di, I < i < k}, in [n], as the set of gap numbers (more simply, 
gaps) of the code C and denote them by the set {gi, 1 < i < n — k}, where 

{9^, l<i<n-k} = [n]\ {d„ l<i<k}. (5) 

Similarly, let the sets {dj-, 1 < j < n — k} and {g^, I < i < k} respectively denote the GHWs and 
gaps of the dual code C . The following lemma |l5l relates the GHWs of C to those of C"*". 

Lemma 1: 

{dul<i<k] = [n]\{n+l-df, l<j<n-k}. (6) 

In terms of the gaps of the dual code C"*-, Q can be rewritten as 

d, = {n + l)~gt,^„ l<i<k. (7) 
In particular, the minimum distance d oi C and the largest gap g^ of C"*" are related by 

d = d^ = {n+l)-gi. (8) 
This relation will be used to derive an upper bound on the minimum distance of (r, 5)i codes. 

III. Codes with information locality 

In this section. Theorem [21 will establish the upper bound appearing in (O, on the minimum distance 
of (r, S)i codes. It will then be shown that pyramid codes, under an appropriate choice of parameters, 
are optimal with respect to this bound. Necessary conditions for optimality of an (r, S)i code for the 
case when r\k, are identified in Theorem [6l 

Theorem 2: The minimum distance d of an (r, 5)i code C is upper bounded by 

'k 



d < n-k+1 



1 (5-1). (9) 



Proof: From ([8]), the minimum distance of C, in terms of the largest gap of C"*- is given by 

d = {n + l)-gi. (10) 

The desired upper bound on d will be obtained by showing the corresponding lower bound on g^. This 
lower bound on g^ will in turn, be deduced from an appropriate upper bound on the ([^] — l) {6— 1)*'^ 



GHW, (i|rn N,. of C"^. It will be established in the next subsection, that under the conditions of 
Theorem 111 



Let df^,y^p-l) 



kl \ 

- -1 (5-1) < n-k, 
r J 

s. Then the number of gaps in the dual that do not exceed s is given by 



(11) 



< 



r J 
— r < k. 



(12) 



-1 (5-1) 



(13) 



Since there are a total of k gaps in the dual code C^, there must be at least an additional k — 
[s — ([7] — 1) (5 — 1)] gaps that exceed s and hence the last gap in the dual, g^, satisfies the lower 
bound: 



gi > s + k 
= k + 

Combining (O and (QOl), we get 





'k 




H 






r 





-1 (5-1). 



(14) 



A Proof of ^ 

We begin with a useful lemma. 

Lemma 3: Let C be a systematic [n, k, d] linear code whose first k coordinates correspond to message 
symbols. Let be a subset of [n] of size s, such that [k] C S. Let V denote a sub code, supported on 
5", of the dual code C-*-, i.e., every code symbol in every codeword in V is zero outside of S. Also, let 
Q = [^mxfel^mx(n-fc)] with m > JO, bc a rank p, (m x n) generator matrix for V. Then we must have 
rank(S) = p and hence s — k > p. 

Proof: Suppose rank(5) < p. Then the row space of Q would contain nonzero vectors in its row 
space which are supported (i.e., nonzero in) only in the first k message symbol coordinates. This is 
not possible as this would imply a relationship amongst the message symbols of the code C. Hence 
rank(5) = p. We also know that the number of nonzero columns in B is less than or equal to s — k. 
It follows that s — k > p. ■ 



We are now ready to prove that 



-1 (5-1) < n-k 



and 



'^aivi)(^-i) - 



1 (r + 5-l). 



For i e [k], let the i^'^ code (message) symbol be locally protected by a code associated to the parity 
check matrix Hi, whose support is Si of size \Si\ = Si < r + 6 — 1. Let Vi denote the row space of Hi 
and let Ui be its dimension. Since the null space of Hi must define a code whose minimum distance is 
greater than or equal to 6, we must have that t'j > 5 — 1, Vi G [/c]. Let us set \E' = U^Lj^S'j and s := |\E'|. 
Let a be the largest integer such that there exists a subset {Vi.}j^i with the property that if 

Wa = V,, + Vi, + ---V,^, (15) 

then for every jo, I < jo < a, we have 

dim{Wa) - dim [ ^ Vi^ j > 6-1. (16) 

In other words, each subspace Vi. contributes at least ((5 — 1) to the total dimension. Clearly, such 
an a exists, for a > 1 is trivially true. Without loss of generality, we reorder the indices so that 

V,^=V„l<j <a. 

We next define Wo = {0}, ^E'o = and for 1 < i < a, 

i 

^. = U)^,Sj, Wi = J2Vj (17) 

i=i 

Aui = dim(Wi) - dim{Wi.i), Asi =| | . 

Clearly, 

Au, > (6-1) (18) 
Asi < {r + 5-1). 

We now examine each subspace for z = a + 1, a + 2, ■ ■ ■ , in turn. Set 

dim{Vi + Wa)-dim{Wa) = Avi 

\Si\^a\ = As,. 

Clearly we must have 

Au, < (6-2) 

As, < Aui. (19) 

The second property follows since any subset of {6 — 1) or less columns of each matrix Hi forms a 
linearly independent set. If either Aui = or As, = we can discard Vi, without affecting the locality 
property. Let iq > a he the first index that has not been discarded. We reorder the indices of the 
remaining Vi so that the indices of Vi,l < i < a remain unchanged and = Va+i and set 

Wa+l = Wa + Va+l 

Then 

dim{Wa+i) - dim{Wa) = Au^+i 

\^a+l\^a\ = ASa+l- 

Continuing in this fashion with a replaced by (a + 1), we will eventually arrive at Wa+b and '^a+b 
with 

dim(M^,+,) -dim(iy,+(,„i)) = Az/,+, < (5-2) 

I ^a+i \ ^a+(i-l) I = ASa+i < AUa+i, (20) 



for 1 < z < 6 < A; — a. Let 



H 



Hi 
H2 



H. 



a+b 



Then 



a+b 

rank(//) = ^Au, 

1=1 

Supp{H) = ^a+b 

a 

Supp(iy) I < ^Asi+^Asi 



a+b 



i=l 



i=a+l 



a+b 



< a{r + {5-l))+ J2 ^^i- 



i=a+l 

We are now in a position to apply Lemma [3l This is because the row space of the matrix H can be 
regarded as a sub code, supported on ^a+b, of the dual code C"*-. Hence, from Lemma [3l it must be 
that: 

Supp(//) > k + mnk{H) 

a+b 

= k + ^AUi. 

i=l 

From the two expressions above for the size of the support of H, we obtain that 

a+b a+b 

a{r + {6-l))+ J2 ^ k + a{6-l)+ J2 (21) 



i=a+l 



i=a+l 



a+b 



ar 



- + X] ^^^^ ~ 



i=a+l 



a > 



(22) 



where (|2TI) and (l22l) follow from (fTSl ) and (|20l) . respectively. 

It follows that the rank(i/) > a{5 - 1) > {6 - - 1). Also, since rank(i^) < (n - A;), we get 

that 

(n-k) > {6-m-]-l), 
r 

and it is hence meaningful to speak of dj-^k-^ ^-j^^ Since the support of each submatrix Hi is < 
(r + 6 — 1), we have that 



<'(r|i-i)(«) S ^r-l-ij(r + *-i). 

and with this, we have recovered the two inequalities appearing in (fTTI) . 



Corollary 4: For an (r, 5)i code C that achieves the bound in ^ with equality, we have 

'k' 



d 



(rivi)(5-i)+i 



1 + 



(23) 



for 1 < 2 < n - A; - (((5 - 1) ([^] - l)) . 

Proof: For an optimal {r,5)i code, the largest gap (see (flOl) ) = k + {\^~\ — 1){5 — I). Thus 
there are exactly [k + \ — — 1)) — A; = ([^] — \){5 — 1) generalized dual distances rff such 
that di <k + { \^ - l)(5 - 1). Hence 



d 



9t + ^ 



(24) 



and the corollary follows. 



B. Optimality of Pyramid Codes for Information Locality 

We will now show that for the case 6 < d, under a suitable choice of parameters. Pyramid codes l|2l 
achieve the bound in Theorem [2] with equality. 

Consider an [k + d — 1, k, d] systematic MDS code over ¥g having generator matrix of the form 

G = [ hxk I Qkx(d~i) ] ■ (25) 

We will now proceed to modify G to obtain the generator matrix for an optimal(r, 5)i code. Let k = 
ar + /3,0 < (3 < (r — 1) and S < d. We now partition Q into submatrices as shown below: 



Q 



' Qi 






Q' 


Qa 




. Qa+l 





(26) 



where Qi,l < i < a are matrices of size r x [5—1), Qa+i is of size /3 x (5 — 1) and Q' is a /c x (d — 5) 
matrix. Consider a second generator matrix G' obtained by splitting the first {5 — 1) columns of Q as 
shown below: 



G' 



'/3 



Qi 



Note that G" is a x n full rank matrix, where 

n = k + d — 1 



Qa 

Qa+l 

1){5-1). 



Q' 



(27) 



(28) 



Clearly, by comparing the matrices G and G' , it follows that the code, C, generated by G' has minimum 
distance no smaller than d. Furthermore, C is an (r, 5)i code. Hence, it follows from (l28l) that C is an 
optimal (r, 5)i code. 



C. The structure of an optimal {r,6)i code, when r\k 

In this section, we will assume that r\k. We borrow notation and intermediate steps used in the proof 
of Theorem [2l 

Theorem 5: If an [n, k, d] linear code C having information locality (r, 6) achieves the bound in Q 



with equality, then | Si 



r + S-1, SinSj = (j), 1 < 2 < j < a and (C^)'^' is MDS, 1 < i < a, where 
a is as defined together by (fT5l) and (fT6l) . 

Proo/.- Since, from (1121). we have a > ^, we get that dim (VF*) > ^(5-1) and k\ < ^(r + 5-1), 
where Wk and are as defined in (flTI) . But from Corollary ^ substituting i = 5 — 1, we get that 



= k + ^{6 — 1). Hence, it must be true that 



and 



Now, since Wi G \a\, \SA < r 



dim{Wk 



k 



-(5-1) 
r 



S -1, from dlOl), it follows that IS",] = r + 5 - 1 and 
Si n Sj = (/), 1 < i < j < a. 



(29) 



(30) 



(31) 



Combining dlH) and dlB, we also get that dim((C-^)^') = 6 e [a]. This implies that the dual of 

(C-*-)"^', which is the code, C l^-, has dimension IS*,] — (5 — 1) = r. Now, noting C I5. has parameters 
[r + 6 - l,r,d], it follows that C \s, and hence (C-^)^' are MDS, Vz e [a]. U 

Theorem 6: If an [n, k, d] linear code C having information locality (r, 6) achieves the bound in ^ 
with equality and d < r + 26 — 1, then S < d and up to a reordering of columns, the parity check 
matrix, H of C can be assumed to be of the form: 



(32) 





Qi 


h-i 













H = 




h-i 






A 





h-s 



where 



A 



Ai I A2 



Ak 



and V? G [^] , the matrix 





h-l 










Id-5 



(33) 



(34) 



generates an [r + d — l,d — l,r + 1] MDS code. The matrices Qi and Ai appearing above are of sizes 
(5 — 1) X r and {d — 5) x r respectively. 

Proof: We will prove Theorem [6] in two steps. For an optimal (r, S)i code, we will show that 
• Step 1: a = - and 5 < d. These, along with Theorem [5] will directly mean that the matrix H has 



(35) 



the form given in (1321) . 

Step 2: Secondly, we will show that Vz G [^] , the matrix 



' Qi 


h-l 





A, 





h-5 



generates an [r + d — l,d — l,r + \] MDS code. 



Proof of Step 1: Let a > 



1. Then, from Theorem [5] it follows that 

1) 

1) 



n > a{r + 5 

> (- + l)(r + 5 



(36) 



k 

= k + -(6-l)+r + 6 
r 

> k + d-S + -{S-l), 
r 



(37) 



where (l37l) follows from the assumption that d < r + 26 — 1. But (l37l) contradicts the assumption the 
code is optimal (see ^) and hence a = ^. 

Next, in order to show that 6 < d, first note from (l36l) that the length of an optimal (r, 6)i code must 
be at least -(r + 5 — 1). But, if one assumes S > d, then from we get that, under optimality. 



n 



d + k 



< 6 + k 
k. 

= -(r + 
r 



'--1 

r 
k 

- - 1 

r 



6 



which results in a contradiction. Hence we conclude that, under optimality, S < d. 

Proof of Step 2: From Theorem \5\ and step 1, we get that the parity check matrix, H, for the code 



C has the form (up to permutation of columns) given in (l32l) . Equivalently, the generator matrix, G, of 
C, which, up to a permutation of columns is of the form 



G 



Qi 



(38) 



Let T denote the index set for the last d — 5 columns of G (i.e., the columns corresponding to AjS) 
and consider a shortened code of C, where the 5" = 5*1 U T. Note that is generated by the matrix 
Gs = [Ir I Qi I ^i] and hence C'^ is an [r + c? — 1, r, ds] code, where ds denotes the minimum distance 
of . Clearly, ds > d, the minimum distance of the code C, since shortening a code only increases 
the minimum distance. This means that has parameters [r + d — l,r,d], i.e., is MDS and so is 
its dual. ■ 



Corollary 7: If r\k, d < r + 25 — 1 and equality is achieved in Q, then 

dj- = r + i l<i<S-l. 



(39) 



IV. Codes with All-Symbol Locality 

In this section, we study (r, S)a codes for the case when (r + 6 — l)\n and 5 < d. Firstly, for the case 
when n = [^] (r + 5 — 1), we will give an explicit construction of a code with all-symbol locality by 
splitting this time, rows of the parity check matrix of an appropriate MDS code. We will refer to this 
as the parity-splitting construction. The code so obtained is optimal with respect to We will also 
show the existence of optimal codes with all-symbol locality without the restriction n = \^~\{r + 5 —\). 
The proof of this theorem uses random coding arguments similar to those used for proving Theorem 
17 in UJ. 



A. Explicit and Optimal (r, 6) a Codes via Parity-Splitting 

Theorem 8: Let n = (r + 5 — 1) and 5 < d. Then, for q > n, there exists an explicit and optimal 
(r, 5) a code over ¥q. 

Proof: Let H' be the parity check matrix of an [n,k',d] Reed-Solomon code over Fg, where 
k' = k + {\!^] - 1){6 - 1) and d = n - k' + 1 = n - k + 1 - {\^,] - 1)(5 - 1). Such codes exist if 
q > n. We choose H'^ri-k')xn ^''^ ^ Vandermonde matrix. Let 



H' 



A 



Q{s~ 



1) xn 



(n— fc'+l— (5)xn 



(40) 



We partition the matrix Q in terms of submatrices as shown below 

Q= [Qi I Q2 1 ... I g^i^] , (41) 

where Qi, 1 < ^ < [^] are matrices of size 6 — 1 x (r + 6 — 1). Next consider the code C whose parity 
check matrix, H, is obtained by splitting the first 6—1 rows of H' as follows: 



H 



Qi 



(42) 



Due to the Vandermonde structure of H', all rows of H are linearly independent. Thus Rank(i?) = 
n-k' + {\!^] -1){6-1). Thus dim(C) = k'-{\^] -1){6-1) = k. It is also clear from the construction 
that this code is an (r, 6)a code. 

Let dmin be the minimum distance of C. Since any set of column^ of H which are linearly dependent 
are also linearly dependent in H', dmin > d = n — k + 1 — {\!^] — 1){5 — 1). But, by we must have 



< d. Hence dr, 



d. 



Remark 1: In the above construction, let k = ar + f3. Let 6k = r 

d-6 = n - k - {\-]){6 - 1) 

r 

k + 6k , 
= r k 



13. Then 



r 



6k- 



(43) 



In particular if r|A;, 5^ = and hence d = 6. 



B. Existence of Optimal (r, 6) codes with All-Symbol Locality 

Here, we will state a couple of definitions and a lemma from |[T1, which will be useful in proving 
the existence of optimal codes with all symbol locality. 

Definition 3 (k-core Let L be a subspace of and S C [n] be a set of size k. S is said to be 
a A;-core for L if for all vectors v E L, Supp(v) ^ S. 

5" is a A;-core of a linear code C if and only if the k columns of the generator matrix of the dual code 
corresponding to S are linearly independent. 

Definition 4 (Vectors in General Position Subject to L j^Tjl): Let L be a subspace of F^. Let G = 
[gi, ■ ■ ■ , g„] be a A; X n matrix over Fg. The columns of G, {gj}[L^ are said to be in general position 
subject to L if: 



^set here indicates indices of tlie columns 



Row space of G, denoted by Row(G) C 
For all A;-cores S of L, we have Rank(G'|5) 



k. 



Lemma 9 (Lemma 14 of Let n,k,q be such that q > kn^. Let L be a subspace of and 
< A; < n — dim(L). Then 3 a set of vectors {gj}[L]^ in that are in general position subject to L. 

Using the above lemma, we will now prove the existence of optimal (r, 5) codes for a general set of 
parameters. 

Theorem 10: Let q > kn^, (r + 5 — l)|n and 5 < d. Then there exists an optimal (r, 5)a code over 

Proof: Proof is similar to the proof of Theorem 17 of [IJ. Let n = (r + 5 — l)t. Let {Fi, ■ ■ • , Pt] 
be a partition of \n], where \Pi\ = r + 5— 1,1 < i < t. Let Qi be the parity check matrix of an 
[r + 5 — 1, r, 5] MDS code with support Pj. Consider 

-Qi 



m 



t(5-l)xn 



Q2 



(44) 



Let L = Rowspace(if'). Since 5 < d, ^ gives that 

^k 



n — k > 

Rearranging the above equation, we get 



1) > -{6 
r 



k<n-t{6-l) 



(45) 



which implies the existence of fc-cores for L exist. Thus, from Lemma |9l 3 {gij^^i, gi E F^ which are 
in general position subject to L. Now consider the code C whose generator matrix Gkxn = [gi ■ ■ 'gnj- 
Clearly, C is an (r, 6) a code, whose length is n and dimension is k. It remains to prove that dmin{C) = d, 
given by the equality condition in Q. Towards this, we will show that (see next sub section) for any 
set S C [n] such that Rank(G'|5) < /c — 1, it must be true that 



|^|<A;-l + (5-l)( [^1-1 



Now the minimum distance of C is given by 

d 



n 



max 1^1 > n - A; + 1 - r-1 - 1 

SC[n] \ r 

Rank(G|s)<fc-l 



.k. 



(S-l). 



(46) 



(47) 



Combining the above equation and (|9l), it follows that the code C has the distance given in the theorem 
statement. ■ 

1) Proof of (|46l) .- Let S C [n] be such that Rank(G'|s) < k — 1. Clearly, S does not contain a A;-core. 
Also, note that any K C [n], \ K\ = k, is a k-core for L if and only if 



\PinK\<r \/ IE [t]. 

Thus there exists some i E [t] such that |Fi fl 5*1 > r + 1. 
Define 

bi := \{i E[t]\ \PinS\=r + i}\ 1<£<6-1. 



(48) 



For 1 < i < 6 — 1, consider the set, Si, obtained from S by dropping i elements of S from each of the 

bi sets {Pi\ iPiC] S\ = r + i}. Clearly, the set Ui<e<s^iSi is an \S\-b1-2b2 (5 - 1)65-1 core 

and thus 

<5-l 

IS"! -iS-l)iY,b^)<\S\-bl-2b2 {5- l)bs^i <k-l. (49) 

Also if we pick r co-ordinates from each Ri such that |Pj fl 51 > r + 1, we get a (?^)(X]f=i 6j)-core. 
Thus, 

5-1 

Combining (|49l) and (|50l) . we have 



-lV(5-l). (51) 



k-1 




k 


r 




r 



1. 



(50) 



\S\<k-l 



V. An upper bound on the minimum distance of concatenated codes 

Consider a (serially) concatenated code (see |[8l, flU) having an [ni, ki, di] code A as the inner code 
and an [712, k2, ^2] code B as the outer code. Clearly, a concatenated code falls into the category of an 
{r,d)a code with 6 = di, r = ni — di + 1. Hence, the bound in ^ applies to concatenated codes as 
well. Using the fact that a concatenated code has length n = nin2, dimension k = kik2, we obtain 
from the results of the present paper, the following upper bound on minimum distance d: 

kik2 



d < ni?7,2 — ^1^2 + 1 



l]{di- 1). (52) 



rii — di + 1 

Well known bounds on the minimum distance of a concatenated codes are 

did2 < d < nid2. (53) 

In practice, concatenated codes often employ an interleaver between the inner and outer codes in order 
to increase the minimum distance ifTOll . In this case, while the upper bound in (|53l) no longer holds, the 
bound in (|52|) continues to hold. 

An asymptotic version of (|52l) can be obtained, if we assume that both the component codes are MDS. 
Let R = ^ and A = ^, respectively, denote the rate and the fractional distance of the concatenated 
code. Similarly let Ai, i?2, A2 denote the corresponding parameters of the component codes. We 
consider the asymptotic case when both ni and n2 tend to infinity. Also assume that di,ki increase in 
proportion with n,, i = 1, 2. Using all these fractional parameters in (|52|) . we get 

R2 di-1 

A < 1 - R1R2 - Aii?2 + — + . (54) 

nx n\n2 

In the limit — 00, i = 1,2, singleton bound gives us i?j = 1 — A^. Using this fact, asymptotically 
( |541 ) becomes 

A <1-R2 = 1-^. (55) 
We remark that this is the same asymptotic bound that one would get from (l53l) . 
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